Model Selection

Image Classification

# Image Classification

Medai Resnet50 Brain

ResNet-50 is a deep residual network developed by Microsoft Research, widely used for image classification tasks.

Image Classification

Cat Dog Root Me

An image classification model built with PyTorch and HuggingPics, capable of accurately distinguishing between pictures of cats and dogs.

Image Classification

Plant Identification Vit

A plant identification model fine-tuned based on Google Vision Transformer (ViT) architecture, achieving 80.96% accuracy on the evaluation set

Image Classification

Utkface Race Classifications

This model is a fine-tuned version of microsoft/resnet-50 on an unknown dataset, primarily used for image classification tasks, achieving an accuracy of 84.86% on the evaluation set.

Image Classification

Kat Tiny Patch16 224.vitft

KAT is a novel vision model that replaces the traditional Transformer's channel mixer with Grouped Rational Kolmogorov-Arnold Networks (GR-KAN), trained on the ImageNet-1k dataset.

Image Classification

Font Identifier

A font recognition model fine-tuned on ResNet-18, achieving 78.1% accuracy on the test set

Image Classification

Font Identifier

A fine-tuned ResNet18 model for font recognition, capable of identifying 48 standard fonts with a test accuracy of 96.33%

Image Classification

Transformers English

Birds Classifier EfficientNetB2

A bird image classifier fine-tuned on EfficientNet-B2, capable of recognizing 525 bird species with up to 99% accuracy

Image Classification

Resnet18 Catdog Classifier

A fine-tuned cat-dog image classification model based on ResNet-18, trained on the Kaggle Cats and Dogs dataset with an accuracy of 99.29%

Image Classification

Transformers English

Organoids Prova Organoid

This model is a fine-tuned image classification model based on Google's ViT-base-patch16-224 on an image folder dataset, achieving an accuracy of 85.76% on the evaluation set.

Image Classification

Image classification model generated by HuggingPics, capable of identifying different dog breeds

Image Classification

Pyramid Vision Transformer (PVT) is a vision model based on transformer architecture, specifically designed for image classification tasks.

Image Classification

A vision model fine-tuned based on google/vit-base-patch16-224, suitable for image classification tasks

Image Classification

Vit Base Letter

An image classification model fine-tuned on a letter recognition dataset based on Google's ViT base model, achieving 98.81% accuracy

Image Classification

Transformers English

Face Discriminator

A face classification model fine-tuned based on Microsoft ResNet-50, achieving 99.84% accuracy on the validation set

Image Classification

Microsoft Swin Tiny Patch4 Window7 224 Ov

This is the OpenVINO version converted from the microsoft/swin-tiny-patch4-window7-224 model, designed to accelerate image classification inference.

Image Classification

Transformers English

Doge is an image classification model generated by HuggingPics, specifically designed to recognize Doge-related images.

Image Classification

Swin Tiny Patch4 Window7 224 Isl Finetuned

A vision model fine-tuned based on microsoft/swin-tiny-patch4-window7-224, achieving 100% accuracy on the evaluation set

Image Classification

Fl Image Category Multi Label

This is an image classification model fine-tuned based on Google's ViT model, trained on the fl_image_category_ds dataset with an accuracy of 66.22%.

Image Classification

Vit Artworkclassifier

Art style classification model based on ViT architecture, capable of identifying the art style category of input images

Image Classification

Fl Image Category

An image classification model fine-tuned based on microsoft/resnet-18, trained on the fl_image_category_ds dataset

Image Classification

A ViT model fine-tuned on the preprocessed 1024 configuration dataset for image classification tasks

Image Classification

Vit Base Patch16 224 Finetuned Algae Wirs

This model is a vision classification model fine-tuned on an algae dataset based on Google's ViT model, primarily used for algae image classification tasks.

Image Classification

An image classification model fine-tuned based on microsoft/resnet-50, achieving an accuracy of 64.1% on the evaluation set

Image Classification

A vision classification model fine-tuned based on google/vit-base-patch16-224 for recognizing first-generation Pokémon

Image Classification

A simple image classification model based on PyTorch and HuggingPics, used to determine whether a person in an image is bald.

Image Classification

Yolo V8 Fog Or Smog Classification

An image classification model based on YOLOv8 for identifying fog and smoke scenes.

Image Classification

Vision Transformer model based on ViT architecture for gender and age classification tasks

Image Classification

Beit Base Patch16 224 Pt22k Ft22k Finetuned FER2013 7e 05 Finetuned SFEW 7e 05

An image classification model based on the BEiT architecture, fine-tuned on the FER2013 dataset for facial expression recognition

Image Classification

Beit Base Patch16 224 Pt22k Ft22k Finetuned FER2013CKPlus

This model is an image classification model based on the BEiT architecture, fine-tuned on the FER2013CKPlus dataset for facial expression recognition tasks.

Image Classification

Efficientformer L3 300

EfficientFormer-L3 is a lightweight vision Transformer model developed by Snap Research, optimized for mobile devices to achieve low latency while maintaining high performance.

Image Classification English

Swin Small Finetuned Cifar100

A small model based on the Swin Transformer architecture, fine-tuned on the CIFAR-100 dataset for image classification tasks

Image Classification

Efficientformer L1 300

EfficientFormer-L1 is a vision Transformer model developed by Snap Research, optimized for mobile devices to achieve extremely low latency while maintaining high performance.

Image Classification English

Swin Tiny Finetuned Cifar100

Image classification model fine-tuned on CIFAR-100 dataset based on Swin Transformer Tiny architecture

Image Classification

Vit Base Patch16 224 In21k Finetuned Cifar10 Test

A fine-tuned test version of Google Vision Transformer (ViT) base model on CIFAR-10 dataset

Image Classification

Vit Hybrid Base Bit 384

The Hybrid Vision Transformer (ViT) model combines convolutional networks and Transformer architectures for image classification tasks, excelling on ImageNet.

Image Classification

An image classification model based on ViT architecture, fine-tuned on an image folder dataset

Image Classification

3d Printed Or Not

This is an image classification model used to determine whether an image is of a 3D-printed object.

Image Classification English

Vit Base Patch16 224 In21k Finetuned Cifar10 Album Vitvmmrdb Make Model Album Pred

A Vision Transformer (ViT) based model fine-tuned on the CIFAR-10 dataset for image classification tasks

Image Classification

Image Spam Detection Keras2

This is a model based on the Keras framework, with unspecified functionality, possibly used for image classification or spam detection tasks

Text Classification

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase